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ABSTRACT 

Class II transfer RNAs (tRNAs), including tRNA Leu 
and tRNA Ser , have an additional stem and loop 
structure, the long variable arm (V-arm). Here, we 
describe Class II tRNAs with a unique anticodon 
corresponding to neither leucine nor serine. 
Because these tRNAs are specifically conserved 
among the nematodes, we have called them 
'nematode-specific V-arm-containing tRNAs' (nev- 
tRNAs). The expression of nev-tRNA genes in 
Caenorhabditis elegans was confirmed experimen- 
tally. A comparative sequence analysis suggested 
that the nev-tRNAs derived phylogenetically from 
tRNA Leu . In vitro aminoacylation assays showed 
that nev-tRNA Gly and nev-tRNA 1 ' 6 are only charged 
with leucine, which is inconsistent with their anti- 
codons. Furthermore, the deletion and mutation of 
crucial determinants for leucylation in nev-tRNA led 
to a marked loss of activity. An in vitro translation 
analysis showed that nev-tRNA Gly decodes GGG as 
leucine instead of the universal glycine code, 
indicating that nev-tRNAs can be incorporated into 
ribosomes and participate in protein biosynthesis. 
Our findings provide the first example of unexpected 
tRNAs that do not consistently obey the general 
translation rules for higher eukaryotes. 

INTRODUCTION 

Transfer RNAs (tRNAs) are small RNA molecules, of 
~70-85 nt, that play a critical role in protein biosynthesis 
as the links between the codons of messenger RNAs 
(mRNAs) and the amino acids in the growing polypeptide 
chains. tRNAs can be classified into two groups based on 
structural differences in their variable regions: Class I 



tRNAs have a short variable region of 4-5 nt, whereas 
Class II tRNAs (e.g. tRNA Leu , tRNA Ser and bacterial 
tRNA Tyr ) have a long variable arm (V-arm) structure con- 
taining >10nt (1). The attachment of the correct amino 
acid to the 3' end of each tRNA is catalyzed by an 
aminoacyl-tRNA synthetase (aaRS), which collectively 
comprise a protein family, with very high accuracy (2). 
The recognition elements for Class I tRNAs are mainly 
found in their acceptor stems and anticodon domains (3). 
In contrast, the recognition elements for Class II tRNAs 
occur in the acceptor stem, D-stem and long V-arm (4-7), 
with the anticodons generally less important for their 
aminoacylation (8). The long V-arms of tRNA Leu and 
tRNA Ser are conserved in all organisms and organelles, 
except animal mitochondria, whereas the long V-arms of 
archaeal and eukaryotic tRNA Tyr were probably lost soon 
after the separation of the domain Bacteria (9). Hence, the 
evolutionary divergence of Class II tRNAs occurred after 
the differentiation of the three domains of life. 

To understand the diversity of tRNA genes in living 
organisms, we analyzed the genome sequences of 
Archaea and primitive eukaryotes, and found that 
various types of tRNAs are encoded in a unique 
manner. These include: (i) intron-containing tRNAs, 
which contain a maximum of three introns located at 
various nucleotide positions, and are found predomin- 
antly in crenarchaeal species (10-13); (ii) tri-split tRNAs, 
comprising three transcripts of Ccddivirga maquilingensis 
(14); and (hi) permuted tRNAs, in which the 5' and 3' 
halves are encoded in a permuted orientation. These 
permuted tRNAs have been found in Cyanidioschyzon 
merolae and other unicellular red/green algae (15,16), 
and more recently, in the crenarchaeon Thermofilum 
pendens by Todd M. Lowe's group (17). The tRNA 
genes in higher eukaryotes were thought to contain a 
single intron at most, located in the canonical position 
(37/38), 1 nt downstream from the anticodon (13,18), but 
a comprehensive analysis of eukaryotic tRNA genes has 
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revealed two putative tRNA Gly and tRNA Ile genes (with 
Class I anticodons) in the Caenorhabditis elegans genome 
that harbor 15-1 6 nt V-arm structures (18). These 
tRNA-like sequences have features found only in typical 
Class II tRNAs, and their evolutionary origins and 
cellular functions remain unclear. 

To assess the evolutionary origins, diversity and func- 
tions of these putative tRNA genes, we comprehensively 
reanalyzed the eukaryotic tRNA genes and identified over 
100 Class II tRNA candidates with anticodons corres- 
ponding to neither leucine nor serine, but possessing the 
long V-arm. These sequences are specifically conserved 
among the nematodes and contain recognition elements 
for aaRSs similar to those of tRNA Leu . Surprisingly, 
we found that these tRNAs in C. elegans can only be 
aminoacylated with leucine, but recognize codons corres- 
ponding to other amino acids. Our findings identify a new 
type of tRNA that translates nucleotides in a manner that 
transgresses the genetic code, emphasizing the importance 
of considering the evolutionary processes underlying the 
general translation rules. 

MATERIALS AND METHODS 

Phylogenetic analysis of the tRNA genes of three 
nematode species 

All mature tRNA sequences, including nematode-specific 
V-arm-containing tRNAs (nev-tRNAs) in C. elegans and 
Caenorhabditis brenneri, were collected from the Genomic 
tRNA database (19), and the Caenorhabditis japonica and 
Pristionchus pacificus tRNA sequences were predicted 
using tRNAscan-SE (20). Any redundant tRNA se- 
quences (multicopy tRNAs with identical sequences) 
were discarded. A neighbor-joining tree of non-redundant 
mature tRNA sequences was constructed using ClustalX 
(21) and visualized with iTOL (22). 

Preparation of total RNA 

The N2 nematode strain and Escherichia coli strain OP50 
used in this work were provided by the Caenorhabditis 
Genetics Center, which is funded by the NIH National 
Center for Research Resources. Total RNA was isolated 
from mixed-stage C. elegans, including eggs, larvae 1—4 
and adults, using TRIzol Reagent (Invitrogen, Carlsbad, 
CA, USA), according to the manufacturer's instructions. 
For acid-urea polyacrylamide gel electrophoresis 
(PAGE)/northern blot analysis, the isolated RNA was 
incubated with 0.2 M Tris-HCl (pH 9.5) at 37°C for 2h 
to deacylate the charged tRNAs. 

Northern blot analysis 

Total RNA (10 ug per lane) was separated on a 
denaturing 6% polyacrylamide gel containing 8M urea 
and transferred onto Hybond-N+ membranes (GE 
Healthcare, Buckinghamshire, UK) by electroblotting. 
The membranes were hybridized with specific oligonucleo- 
tide probes (Supplementary Table S1A), labeled with a 
Biotin 3'-End DNA Labeling Kit (Pierce Biotechnology, 
Rockford, IL, USA) in ULTRAhyb-Oligo Hybridization 



Buffer (Ambion, Austiin, TX, USA) at room temperature. 
The hybridized membranes were treated with 5x Wash 
Buffer (Ambion) at room temperature. The non-isotopic 
blots were detected with the BrightStar BioDetect 
Kit (Ambion) using the Enhanced ChemiFluorescence 
(ECF) substrate (GE Healthcare). The images were 
visualized with a Molecular Imager FX Pro (Bio-Rad 
Laboratories, Hercules, CA, USA). 

RT-PCR analysis and nucleotide sequencing 

Reverse transcription (RT)-PCR was performed with the 
enzymes, ReverTra Dash and KOD FX (Toyobo 
Biochemicals, Osaka, Japan). All amplification reactions, 
except those amplifying nev- tRNA Gly (CCC), consisted of 
30 cycles of denaturation at 98° C for 10 s, annealing at 
55°C for 3 s and extension at 74°C for 6 s. PCR of 
nev- tRNA Gly was performed with stepdown amplification, 
consisting of 15 cycles of denaturation at 98°C for 10 s, 
annealing at 74°C (5 cycles), 72°C (5 cycles), 70°C (5 
cycles) and 60° C (15 cycles) for 6 s, and extension at 
74°C for 6 s. The PCR products were separated by elec- 
trophoresis on 3% NuSieve 3 : 1 agarose gels (Cambrex 
Bio Science, Rockland, ME), which were stained with 
ethidium bromide. The bands were visualized with a 
Molecular Imager FX Pro (Bio-Rad Laboratories). The 
primers used in this analysis are summarized in 
Supplementary Table SIB. The RT-PCR products were 
further purified with an Illustra GFX PCR DNA and Gel 
Band Purification Kit (GE Healthcare), and subcloned 
into the pCR-Blunt II-TOPO vector (Invitrogen). The 
nucleotide sequences of the inserted DNAs were 
determined on an ABI PRISM 3100 DNA Sequencer 
(Applied Biosystems, Foster City, CA, USA). 

Preparation of recombinant aminoacyl-tRNA synthetases 
from C. elegans 

Caenorhabditis elegans total RNA was used as the 
template for RT-PCR of four genes encoding aminoacyl- 
tRNA synthetases (aaRSs): glycyl-tRNA synthetase 
(GlyRS, GenBank accession no. NP_871640.1), 
isoleucyl-tRNA synthetase (IleRS, GenBank accession 
no. NP_501914.1), leucyl-tRNA synthetase (LeuRS, 
GenBank accession no. NP_497837.1) and seryl- 
tRNA synthetase (SerRS, GenBank accession no. 
NP_50 1804.1). RT-PCR was performed using the 
ReverTra Dash Kit and KOD FX (Toyobo 
Biochemicals), and the amplified products were cloned 
into the pET-23 b expression vector (Novagen, Madison, 
WI, USA). Each resulting vector encoded a full-length 
aaRS with a six-histidine tag at its C-terminal end. The 
nucleotide sequences of the inserted DNA fragments were 
determined on an ABI PRISM 3100 DNA Sequencer 
(Applied Biosystems) and confirmed to be identical to 
those in the database. The primers and restriction 
enzymes used for cloning are summarized in 
Supplementary Table SIC. 

The resulting vectors were used to transform E. coli 
strain BL21(DE3)pLysS (IleRS and LeuRS) or 
HMS174(DE3)pLysS (GlyRS and SerRS). The 
transformants, growing logarithmically at 37°C in 
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Luria-Bertani medium containing 50 ug/ml ampicillin and 
35 ug/ml chloramphenicol, were supplemented with 
0.1-0.4mM isopropyl-p-D-thiogalactopyranoside. After 
14-16 h of further growth at 16°C, the cells were pelleted 
by centrifugation (5000 g for 5min at 4°C) and sonicated 
(3 min) in 1 x phosphate buffered saline (PBS) with 10 mM 
imidazole. The extracts were centrifuged at 12 000g for 
10 min at 4°C to remove any debris, and the recombinant 
proteins were purified with a Proteus IMAC Protein 
Purification Kit (Pro-Chem, Littleton, MA, USA). The 
eluted protein solutions were pooled and gel filtered to 
remove any salt on a HiTrap column (GE Healthcare) 
with buffer A (50 mM Tris-HCl [pH 8.0], ImM EDTA, 
0.02% Tween 20, 7mM 2-mercaptoethanol and 10% 
glycerol). 

In vitro aminoacylation assay 

The tRNAs synthesized in vitro with T7 RNA polymerase 
were incubated at room temperature with purified recom- 
binant aaRSs from C. elegans for 0, 2.5, 5, 10, 20, 40 and 
60 min in a 25 ul reaction mixture containing 50 mM 
Hepes (pH 7.2), 50 mM KC1, 4mM ATP, 15mM 
MgCl 2 , 3mM dithiothreitol, 15mM amino acid (glycine, 
isoleucine, leucine, or serine), 3 ug of tRNA and 1 ug of 
aaRS (except IleRS, for which we used 5 ug to accommo- 
date its lower aminoacylation efficiency). The reaction was 
stopped by the addition of an equal volume of acid-urea 
stop buffer (23). The charged tRNAs were separated on 
acid-urea 6.5% polyacrylamide gels, which were stained 
with SYBR Green II (Lonza, Rockland, ME, USA). The 
bands were detected with a Molecular Imager FX Pro 
(Bio-Rad Laboratories). 

Acid-urea PAGE/northern blot analysis 

Alkali-treated total RNA (100 ug; see 'Preparation of total 
RNA' section) was used in the in vitro aminoacylation 
assay, with purified recombinant LeuRS and GlyRS 
titrated from 0.01 to 1 ug. The assay products were 
separated on acid-urea 6.5% polyacrylamide gels, 
followed by northern blot hybridization to detect the in- 
dividual tRNAs. 

Construction of expression vectors for cell-free protein 
synthesis 

For in vitro transcription/translation, we prepared two 
expression vectors encoding PF1549 protein (RNA 
3'-terminal-phosphate cyclase of Pyrococcus furiosus 
(24), GenBank accession no. NP_579278) and firefly 
luciferase (GenBank accession no. ACF93 193.1). 
Pyrococcus furiosus genomic DNA and luciferase ICE 
T7 Control DNA (Promega, Madison, WI, USA) were 
used as the templates for the PCR of PF1549 (24) and 
luciferase, respectively. PCR was performed with KOD 
FX (Toyobo Biochemicals), and the amplified products 
were cloned into the pF25A ICE T7 Flexi Vector 
(Promega) containing the T7 promoter. The resulting ex- 
pression vectors encoded full-length PF1549 or luciferase, 
with six-histidine tags at their C-terminal ends. The nu- 
cleotide sequences of the inserted DNA fragments were 



determined on an ABI PRISM 3100 DNA Sequencer 
(Applied Biosystems) and confirmed to be identical to 
those in the database. The primers and restriction 
enzymes used for cloning are summarized in 
Supplementary Table SIC. 

In vitro translation analysis 

Each 46 ul cell-free protein expression reaction mixture, 
with coupled transcription (driven by the T7 promoter) 
and translation, contained 40 ul of cell extract from the 
Spodoptera frugiperda Sf21 cell line (Promega), 4ug of 
the T7 expression vector encoding PF1549 protein or 
firefly luciferase and 4ug of in vzYro-transcribed tRNA 
[tRNA Leu (AAG) or nev-tRNA Gly (CCC)]. After incuba- 
tion at 29° C for 4h, the cell ly sates were sonicated (1 min) 
with a Polytron sonicator (Kinematica, Bohemia, NY, 
USA). The samples containing the in v/fro-translated 
PF1549 proteins were incubated at 80° C for 15 min to 
destroy any endogenous insect proteins. The homogenates 
were centrifuged at 13 000g for 10 min at 4°C to remove 
any debris, and the His-tagged recombinant proteins were 
purified with a Talon Magnetic Beads Buffer Kit 
(Clontech Laboratories, Mountain View, CA, USA). 

Nano liquid chromatography-tandem mass spectrometry 
(NanoLC-MS/MS) analysis 

The purified proteins were extracted with lOOmM Tris- 
HCl (pH 9.0) containing 12mM sodium deoxycholate 
(SDC) and 12 mM sodium lauroyl sarcosinate (SLS), 
then reduced with lOmM dithiothreitol at room tempera- 
ture for 30 min and alkylated with 55 mM iodoacetamide 
in the dark at room temperature for 30 min. The samples 
were diluted 5-fold with 50 mM ammonium bicarbonate 
and digested with MS-grade lysyl endoprotease, followed 
by overnight trypsin digestion. The SDC and SLS were 
removed from the sample solutions (25,26) by the addition 
of ethyl acetate, acidification with 0.5% trifluoroacetic 
acid, shaking for 1 min, and centrifugation at 15 700g 
for 2 min. The aqueous phase was desalted with 
StageTips on a C18 Empore Disk (27). 

An LTQ-Orbitrap (Thermo Fisher Scientific, Bremen, 
Germany) equipped with a nanoLC interface (Nikkyo 
Technos, Tokyo, Japan) and a Dionex Ultimate3000 
pump with a FLM-3000 flow manager (Dionex, 
Germering, Germany) and an HTC-PAL autosampler 
(CTC Analytics, Zwingen, Switzerland) were used for 
nanoLC-MS/MS measurements. Reprosil CI 8 materials 
(3 urn; Dr Maisch, Ammerbuch, Germany) were packed 
into a self-pulled needle (150 mm length, 375 urn 
O.D. x 100 um I.D.) with a nitrogen-pressurized column 
loader cell (Nikkyo Technos, Tokyo, Japan) to prepare an 
analytical column needle with a 'stone-arch' frit (28). The 
flow rate was 500 nl/min. The mobile phases consisted of 
(A) 0.5% acetic acid and (B) 0.5% acetic acid and 80% 
acetonitrile. A three-step linear gradient was used: 5-10% 
B in 5 min, 10^10% B in 60 min, 40-100% B in 5 min and 
100% B for 10 min. A spray voltage of 2400 V was 
applied. The MS scan range was m/z 300-1500 in the 
Orbitrap analyzer and the top 10 precursor ions were 
selected for subsequent MS/MS scans in the ion trap. 
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Data analysis for protein identification 

The raw data files were analyzed with Mass Navigator 
vl.2 (Mitsui Knowledge Industry, Tokyo, Japan) to 
create peak lists based on the recorded fragmentation 
spectra. The peptides were identified with Mascot v2.2 
(Matrix Science, London, UK) in a comparison with an 
in-house protein database containing PF1549, luciferase 
and keratin proteins and trypsin sequences. A precursor 
mass tolerance of 3ppm and a fragment ion mass tol- 
erance of 0.8 Da were used with strict trypsin speci- 
ficity, allowing up to two missed cleavages (29). 
Carbamidomethylation of cysteine was set as a fixed 
modification, and methionine oxidation was allowed as a 
variable modification. Peptides were rejected if the Mascot 
score was below the 99% confidence limit based on the 
'identity' score of each peptide. False-positive rates were 
estimated by comparison with a randomized decoy 
database created with the Mascot Perl program and 
supplied by Matrix Science. 

RESULTS AND DISCUSSION 

Nematode-speciflc Class II tRNAs 

Although a set of unique Class II tRNA-like sequences 
with Class I tRNA anticodons has been identified in the 
genome of C. elegcms, the evolutionary and functional im- 
plications of these tRNA genes remain unclear. To assess 
whether these tRNAs are conserved in the eukaryotes and 
to determine their possible functions, we collected the se- 
quences of 49 872 tRNAs from 44 eukaryote species regis- 
tered in the Genomic tRNA Database (19). We also 
obtained the genomic sequences of C. japonica and 
P. pacificus from the University of California Santa 
Cruz (UCSC) Genome Bioinformatics database (30). 
Application of tRNAscan-SE (20) with default parameters 
predicted an additional 2138 tRNA genes. We screened 
these 52 010 tRNA sequences for candidate Class II 
tRNAs that contain the conserved basic tRNA features 
(1,18), including a 7-bp acceptor stem, a 3-4-bp D-stem, a 



5-bp anticodon stem, a 5-bp T-stem, a 7-nt T-loop, a 7-nt 
anticodon loop and a Class I tRNA anticodon specifying 
an amino acid other than leucine or serine. We identified 
a total of 115 candidate tRNAs from six nematode 
species: C. brenneri, C. briggsae, C. elegans, C. japonica, 
C. remanei and P. pacificus (summarized in 
Supplementary Data set SI). All the candidate tRNAs 
were observed in nematodes, with none in any other eu- 
karyotic genome. Therefore, we designated these unusual 
Class II tRNAs 'nematode-specific V-arm-containing 
tRNAs' (nev-tRNAs). 

When we assessed the distribution of the predicted 
nev-tRNA genes among the six nematode species 
(Figure 1), we found that three species, C. remanei, 
C. briggsae and C. brenneri, encoded an average of 
31 nev-tRNAs each, with C. brenneri encoding 56 
nev-tRNAs, corresponding to seven different anticodons. 
This indicates the rapid development of many nev-tRNAs. 
In contrast, C. elegans, C. japonica and P. pacificus each 
encoded only 6-8 nev-tRNA genes. These three species 
diverged earlier within the nematode lineage than the 
former three species, suggesting that the number of 
nev-tRNAs and their anticodon variations have increased 
during the evolution of the nematode taxon, and are es- 
pecially prominent in C. brenneri. These findings also 
indicate that nev-tRNA Gly (CCC) and nev-tRNA Ile 
(UAU) are the two major types of nev-tRNAs and are 
widely conserved among the nematode genomes. 
Intriguingly, nev-tRNA Gly s (CCC), found only in 
C. briggsae, C. brenneri and C. elegans, are the only 
tRNAs that correspond to the codon GGG (shown as 
red boxes in Figure 1). Although all nematode genomes 
encode a Class I tRNA Ile (UAU) containing a single 
intron, most tRNAs corresponding to the UAU anticodon 
are nev-tRNA Ile s (UAU) (shown as yellow boxes in 
Figure 1). Furthermore, three nev-tRNAs, Pro(GGG), 
Arg(CCU) and Arg(UCU), in C. brenneri constitute the 
entire population of tRNAs that possess these cognate 
anticodons. Therefore, the abundance and evolutionary 
conservation of nev-tRNAs suggest that they play import- 
ant roles in cellular processes. 
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Figure 1. Evolutionary conservation of the nev-tRNAs in nematodes. For each nematode species, the ratios of nev-tRNA genes to tRNA genes 
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Sequence characteristics of nev-tRNAs and their 
evolutionary background 

We selected three nematode species to examine the origins 
and evolution of the nev-tRNA genes: C. brenneri (which 
has more nev-tRNAs than any other species), C. elegans 
(used for experimental verification, as described below) 
and P. pacificus (representing the earliest branch within 
the nematode lineage). Using 151 tRNA exons, including 
the exons of 13 nev-tRNAs from these three nematode 
species, we performed a phylogenetic analysis of the 
non-redundant tRNA sequences (Supplementary Figure 
SI). We found that four types of nev-tRNAs, Ile(UAU), 
Gly(CCC), Arg(CCU) and Lys(CUU), clustered into a 
Class II tRNA clade, especially close to the tRNA Leu 
(UAA) family. Moreover, the Class I tRNAs with these 
same anticodons clustered separately from these 
nev-tRNAs on the phylogenetic tree of nematode 
tRNAs. Because nev-tRNA Ile (UAU) and nev-tRNA Gly 
(CCC) are widely conserved in the nematode linage 
(Figure 1), these results suggest that at least the major 



nev-tRNAs probably evolved from Class II tRNAs, 
rather than by gene duplication of synonymous tRNAs. 
In contrast, six types of nev-tRNAs in C. brenneri clus- 
tered within the Class I tRNA family, suggesting that 
nev-tRNAs emerged on two independent occasions 
during tRNA evolution. We also found that a single 
C. brennneri tRNA Leu (UAA) occurred within the 
nev-tRNA clade. Its sequence was almost identical 
(91.6%) to that of C. brennneri nev-tRNA Arg (CCU), 
but was only 55.9% similar to the synonymous tRNA Leu 
(UAA), indicating that it originated from the nev-tRNA 
gene (shown as an asterisk in Supplementary Figure SI). 

To analyze the characteristics of the nev-tRNA se- 
quences relative to their nucleotides at each position, we 
compared C. elegans nev- tRNA Gly (CCC) and 
nev-tRNA Ile (UAU) with their synonymous Class I 
tRNA Gly (GCC), tRNA Ile (UAU) and tRNA Leu (UAA) 
sequences (Figure 2). Each of these nev-tRNAs forms a 
cloverleaf secondary structure, has endogenous A-box 
(5'-TGGCNNAGTGG-3') and B-box (5'-GGTCGA 
NNCC-3') promoters (31), and contains a V-arm of 
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Figure 2. Comparison of nucleotide sequences and secondary structures of two nev-tRNAs and their related tRNAs in C. elegans. Nucleotides 
conserved between nev- tRNA Gly (CCC) and the synonymous Class I tRNA Gly (GCC) are shown in blue; those between nev-tRNA e (UAU) and the 
synonymous Class I tRNA Ile (UAU) are shown in yellow; and those between the two nev-tRNAs and the Class II tRNA Leu (UAA) are shown in red. 
Nucleotides conserved among the three V-arm-containing tRNAs are boxed. Sequence similarities are also shown for each pair of tRNAs. 
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Figure 3. Expression of nev-tRNAs in C. elegans. The results of the 
northern blot analysis of the three nev-tRNA genes found in C. elegans, 
nev-tRNA Gly (CCC), nev-tRNA Ile (UAU) and nev-tRNA Lys (CUU), 
and their synonymous Class I tRNA genes, tRNA Gly (GCC) and 
tRNA lle (UAU) (control), are shown. Black triangles indicate that 
the band sizes were approximately consistent with the predicted 
lengths of the nev-tRNAs based on the bioinformatics approach used 
in this study. nev-tRNA Lys was not detected, even when the amount of 
total RNA was increased to 30 ug per lane. Contrast levels were 
adjusted for the nev-tRNA images. 



15-16 nucleotides that forms a stem-loop structure. We 
found that nev- tRNA Gly (CCC) showed greater sequence 
similarity to tRNA Leu (UAA) (64.3%) than to Class I 
tRNA Gly (GCC) (56.3%). The results for the 
nev-tRNA 1 e (UAU) sequence are similar. The nucleotides 
conserved among these three Class II tRNAs were 
concentrated in the D-arm and V-arm regions, which are 
both recognition elements for LeuRS (4,5) (shown as 
boxed nucleotides in Figure 2). These results indicate 
that the nev-tRNAs may be recognized by LeuRS and 
charged with leucine. 

nev-tRNAs are weakly expressed in C. elegans 

Although our bioinformatics approach indicated that the 
nev-tRNAs probably originated from tRNA Leu and are 
expanding among the nematode genomes, it was unclear 
whether they are expressed in cells. Therefore, we assayed 
the expression of the nev-tRNA genes by northern blot 
analysis of the total RNA extracted from mixed stages 
of C. elegans, including eggs, Ll-4 larvae and adults, 
using specific oligonucleotide probes (Supplementary 
Table SI A). We detected the expression of nev-tRNA y 
(CCC) and nev-tRNA Ile (UAU) at their predicted lengths, 
but no expression of nev-tRNA Lys (CUU) (Figure 3). RT- 
PCR analysis confirmed the expression of all three 
nev-tRNAs at their expected lengths (Supplementary 
Figure S2), with the nucleotide sequences of the amplified 
products identical to those of the nev-tRNAs. Moreover, 
none of these nev-tRNAs contained a V-arm spliced 
tRNA band of ~70bp, indicating that the V-arm 
domain is part of the mature tRNA, not an intronic 
sequence that is spliced out during tRNA processing. 
These results demonstrate that although expressed, the 



levels of nev-tRNAs are lower than those of general 
tRNAs. 

nev- tRNA Gly and nev-tRNA e are specifically 
aminoacylated with leucine 

The V-arm domain and the other determinant sites for 
LeuRS that are conserved in the nev-tRNAs suggest that 
they are leucylated, whereas their anticodons correspond 
to other amino acids. To understand the functions of the 
cellular nev-tRNAs, we determined the amino acids with 
which they are charged. We attempted to aminoacylate C. 
elegans nev-tRNA Gly (CCC) and nev-tRNA Ile (UAU) 
in vitro with four purified recombinant C. elegans aaRSs: 
GlyRS, IleRS, SerRS and LeuRS (Supplementary Figure 
S3). Each control tRNA was successfully aminoacylated 
with the aaRS and the amino acid corresponding to its 
anticodon (Figure 4A, left). Under the same conditions, 
nev-tRNA Gly (CCC) and nev-tRNA Ile (UAU) were not 
aminoacylated with the amino acids corresponding to 
their respective anticodons, nor with serine, but with 
leucine in the presence of LeuRS (Figure 4A, right). 
When we assessed the aminoacylation efficiency and spe- 
cificity of each nev-tRNA at different time points, we 
found that both nev-tRNAs had a lower leucylation effi- 
ciency than tRNA Leu (AAG) and that, conversely, almost 
no tRNA was aminoacylated by GlyRS or IleRS 
(Figure 4B). 

Because many examples have so far been reported of 
tRNA modifications that affect the recognition by aaRSs 
(32,33), we conducted further aminoacylation assays using 
total RNA fractions, followed by the identification of the 
mature nev-tRNAs by northern blot analysis, to validate 
the leucylation of native nev-tRNAs in vivo. Because the 
expression levels of native nev-tRNAs are low, we 
designed an oligonucleotide complementary to full-length 
nev-tRNA Gly as the northern hybridization probe 
(Supplementary Table SI A) to improve the signal 
strength. We used the most stringent hybridization condi- 
tions to detect nev-tRNA Gly with the full-length probe and 
found that almost all the cross-hybridization signals could 
be reduced by increasing the hybridization and wash tem- 
peratures above 55°C (Figure 5A). Accordingly, we per- 
formed the hybridization reactions for in vivo nev-tRNAs 
at 60°C and successfully demonstrated that native 
nev-tRNA Gly is aminoacylated with leucine but not 
glycine (Figure 5B). These results suggest that 
nev-tRNAs are only ever charged with leucine both 
in vitro and in vivo. 

To determine the nucleotides in the nev-tRNAs 
recognized by LeuRS, we tested the ability of LeuRS to 
aminoacylate nev- tRNA Gly (CCC) variants with muta- 
tions at several recognition sites. The V-arm domains 
are major determinants of the recognition of Class II 
tRNAs (4-7), and X-ray structural analysis of the 
Pyrococcus horikoshii LeuRS-tRNA Leu complex showed 
that the protruding C-terminal domain of LeuRS specif- 
ically recognizes the bases at the tip of the V-arm (34). 
Caenorhabditis elegans encodes five synonymous 
tRNA Leu genes, and the V-arm sequences of its 
nev-tRNAs are most similar to that of tRNA Leu (UAA), 
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Figure 4. Aminoacylation assay of nev-tRNAs. (A) In v/fro-transcribed tRNAs [four types of control tRNAs, nev-tRNA Gly (CCC) and nev-tRNA Ile 
(UAU)] were aminoacylated using four recombinant C. elegans aaRSs: GlyRS, IleRS, SerRS and LeuRS (Supplementary Figure S3). The 
aminoacylated tRNAs (aa-tRNAs) were separated on an acid-urea polyacrylamide gel and stained with SYBR Green II. (B) Measurement of the 
aminoacylation efficiencies of the nev-tRNAs at 0, 2.5, 5, 10, 20, 40 and 60 min. (Left panel) Leucylation of tRNA Leu (AAG), tRNA Ser (CGA) and 
two nev-tRNAs by LeuRS. (Middle panel) Glycylation of tRNA Gly (GCC) and nev-tRNA Gly (CCC) by GlyRS. (Right panel) Isoleucylation of 
tRNA Ile (UAU) and nev-tRNA Ile (UAU) by IleRS. 
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Figure 5. In n'vo-modified nev-tRNA Gly (CCC) is specifically charged with leucine. (A) Validation of the hybridization stringency conditions for 
detecting nev-tRNA Gly (CCC) using the full-length probe. In W/ro-transcribed nev-tRNA Gly (CCC), tRNA Gly (GCC) and tRNA Leu (AAG) were 
detected using an oligonucleotide fully complementary to nev-tRNA Gly . Northern hybridization and washing treatments were performed at 50, 55, 60 
and 65°C. (B) The aminoacylation of native nev- tRNA Gly (CCC) determined by acid- urea PAGE/northern hybridization. An aminoacylation assay 
was performed at room temperature for 60 min with purified recombinant LeuRS or GlyRS titrated from 0.01 to 1 ug. 



with the bases at the tip of the nev-tRNA V-arms (U47b 
and A47c) conserved in both tRNA Leu s (UAA/CAA) 
(Figure 6A). Complete deletion of the nev-tRNA Gly 
(CCC) V-arm domain led to a severe loss of leucylation 
(Figure 6B, Ml), and replacement of the 2nt at the tip of 
the V-arm reduced the leucylation efficiency ~2-fold 
(Figure 6B, M2). This suggests that the V-arm domains 
of the nev-tRNAs, primarily the 2 nt at their tips, are ne- 
cessary for leucylation by C. elegans LeuRS, as are the 
determinants in tRNA Leu . In contrast, the discriminator 
base A73 of tRNA Leu is highly conserved throughout all 



living organisms and is important for leucylation in E. coli 
(35), yeast (36) and humans (4). A73 is also conserved in 
all tRNA Leu genes in the six nematode species examined, 
as well as in all 115 nev-tRNAs, whereas the Class II 
tRNA Ser genes contain a conserved G73 (Figure 6A). 
We found that the leucylation efficiency of nev- tRNA Gly 
(CCC) was strongly influenced by base substitutions at 
A73 (Figure 6B, M3). The strong evolutionary conserva- 
tion of A73 and the V-arm suggest that nev-tRNAs have 
been under evolutionary pressure to be specifically 
recognized by LeuRS. 
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Figure 6. Mutagenesis of nev- tRNA Gly (CCC) sites recognized by LeuRS. (A) Sequence alignments of the V-arm domains of Class II tRNAs, 
including nev-tRNAs, in C. elegans. The nucleotide positions are numbered as described previously (1). (B) (Left panel) The nev-tRNA Gly (CCC) 
variants used in this study. (Right panel) Leucylation efficiencies of tRNA Leu (AAG), wild-type nev-tRNA Gly (WT) and the three nev- tRNA Gly 
variants (Ml-3). Each experiment was performed in duplicate to confirm the reproducibility of the enzymatic reactions. 



nev-tRNA Gly is incorporated into eukaryotic ribosomes 
during translation and decodes the GGG codon as leucine 
in vitro 

Finally, to demonstrate whether nev-tRNAs are 
incorporated into ribosomes and function in translation, 
we used these nev-tRNAs in an in vitro translation assay 
and determined the sequences of the resultant peptides 
with MS. In these experiments, we attempted to synthesize 
two proteins, PF1549 (24) and firefly luciferase, because (i) 
their mRNAs contain 6-8 GGG codons, corresponding to 
glycine; and (ii) PF1549, an RNA 3'-terminal-phosphate 
cyclase in the hyperthermophilic archaeon Pyrococcus 
furiosus, can be purified easily by heat treatment. The 
in vitro translation reactions were performed in an insect 
cell-free protein expression system using in vitro- 
transcribed tRNAs, either tRNA Leu (AAG) (control) or 
nev-tRNA Gly (CCC). After partial purification with a 
magnetic separator, SDS/PAGE analysis showed that 
the PF1549 protein and luciferase were successfully 
synthesized at their expected sizes (Supplementary 
Figure S4). To examine whether specific leucines, derived 
from the nev-tRNA decoding of GGG codons, were con- 
tained in the proteins synthesized in the presence of 
nev- tRNA Gly , we analyzed the purified proteins with 
nanoLC-MS/MS (summarized in Supplementary Data 
set S2). Surprisingly, proteins containing leucine residues 
translated from GGG codons were only observed in the 
presence of nev- tRNA Gly , in both PF1549 (Table 1) and 
luciferase (Supplementary Table S2 and Supplementary 
Figure S5). Although leucine is indistinguishable from iso- 
leucine on MS, these amino acid residues were almost cer- 
tainly leucines because in vitro aminoacylation assays 
showed that nev-tRNA Gly is not charged with isoleucine. 
Peptides containing glycine translated from GGG codons 
were also observed in the presence of nev-tRNA Gly , sug- 
gesting that the GGG codon is also decoded by endogen- 
ous insect tRNA Gly (UCC) via G-U wobble base pairing 
(37). These results strongly suggest that in insect cells, (i) 
nev-tRNA Gly is aminoacylated with leucine; (ii) leucylated 



Table 1. In vitro translation analysis using nev- tRNA Gly 



Sequence (PF1549) Mascot score (n = 4) 



tRNA Leu (control) nev-tRNA Gly 



GGGIVAGYVKPWIER 


45.0 


37.4 


GGG I VAGYVKPWI ERK 


57.6 
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73.6 


KLANAKVEGAEVGSR 


75.1 
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20.1 
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20.3 


LKPAEEVGR 




30.5 


LKPAEEVGREAAQELLSQVK 




42.1 


VEGAEVLSR 




63.0 



The PF1549 protein was translated in vitro in the presence of tRNA Le " 
(AAG) or nev-tRNA Gly (CCC). Peptide sequences with the amino acid 
residues (underlined) arising from the decoding of the GGG codon are 
listed. Mascot scores were determined from four independent 
experiments. 



nev-tRNA Gly is incorporated into ribosomes during trans- 
lation; and (iii) leucylated nev-tRNA Gly recognizes GGG 
codons, resulting in the incorporation of leucine, which 
does not correspond to their anticodons. 

nev-tRNAs tend to correspond to rare codons 

In this work, we have identified a new group of Class II 
tRNA genes, which probably originated and diversified 
from the tRNA Leu gene. These unique tRNAs are found 
only in several nematode species, and their numbers and 
the types of anticodons involved have tended to increase 
during nematode evolution. On the basis of the codon 
usage of C. elegans, we found that the major nev-tRNAs 
correspond to the three rarest codons in the organism, 
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Gly(GGG), Pro(CCC) and Arg(AGG), which comprise 
0.44, 0.44 and 0.38%, respectively, of the codons in 
C. elegans (Supplementary Figure S6). The usage of 
codon Ile(ATA) is lowest among the three synonymous 
isoleucine codons (0.94%), but is higher than that of the 
other rare codons. This may be attributable to the coex- 
istence of both Class I tRNA Ile (UAU) and nev-tRNA Ile 
(UAU) in the nematode genomes (Figure 1). Because iso- 
leucine and leucine are chiral amino acids with highly 
similar chemical properties, the penetration of 
nev-tRNA against the Ile(ATA) codon would have the 
least effect on the proteome. This hypothesis is also sup- 
ported by the findings that other major nev-tRNAs tend 
to selectively penetrate the rare codons and are weakly 
expressed in the cell. 

Recent studies have indicated that tRNAs have alterna- 
tive functions associated with the stress responses (38,39). 
In Gram-negative bacteria, such as E. coli, uncharged 
tRNAs block protein synthesis by penetrating the A site 
of the ribosome in response to amino acid starvation (38). 
In mammalian cells, the response to oxidative stress 
involves the selective mismethionylation of non-methionyl 
tRNAs, with the resulting Met-misacylated tRNAs used 
in translation (39). These data suggest that nev-tRNAs 
have non-canonical roles under certain conditions, 
indicating the need for additional research into their 
expression patterns under nutritional, heat and oxida- 
tive stress, and at different developmental stages. In 
conclusion, we have shown here that: (i) nev-tRNAs 
are present only in the nematode lineage and tend to 
correspond to rare codons; (ii) nev-tRNAs are weakly 
expressed in C. elegans under laboratory breeding 
conditions; (iii) nev-tRNAs are only charged with 
leucine in vitro; and (iv) nev-tRNAs are incorporated 
into eukaryotic ribosomes during translation and 
decode rare codons to leucine in vitro, contrary to the 
genetic code. These results may provide further 
insight into the evolution of alternative genetic codes 
in eukaryotes. Because the precise biological roles 
of nev-tRNAs in nematode cells remain unclear, 
future studies are required to identify their functions 
in vivo. 
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